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ABSTRACT 


Remedel as presented for the prediction of future organs 
ization size based on the numb-rs of recruits entering the 
SreanigZzauron an the past. Thais model utilizes the corremes 
tion between populations of successive time periods in 
order to better estimate the future remaining personnel in 
the system. The number of personnel leaving from each 
recruit cohort is assumed to follow the same probability 
distribution, which is a function of the age of the cohorg 
in the organization and the grade in which the cohort 
started. For large cohort sizes the total personnel in the 
system is approximately normally distributed. This result 
justifies the use of a best linear prediction method whem 
takes into account past errors of estimating the continua 
Population from one period to the next. Sensitivity eem 
predictions to errors in probability estimates is discussed. 
The model is applied to predicting university student 


enrollment. Comparison of predicted and actual student 
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enrallmant 4<.—-4~—- 
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io DER ODUC TLON 


In many large corporations and institutions with a htena 
rate of personnel turnover, a crucial problem in recruitment 
planning is that of predicting from one period to the next, 
how many personnel presently in the organization will 
remain. When the length of service of a single member is 
fixed, or completely controlled by the organization, the 
problem is trivial. However, when the length of service 
is variable, such as with middle management in large 
corporations and the military service, or university student 
Beares, probabilistic arguments must be used to estimate 
expected attrition. 

The theory of Markov chains has been widely used in 
prediction models. A basic assumption in such models is 
One of stationarity of rates of movement within a system 
Geet inea svaves. In order to assess "the Transition 
probabilities between various states in the system, it is 
necessary to identify various characteristics of personnel 
in the organization in the past to make predictions in the 
future. A number of such models can be found in the liter- 
ature, including Bartholomew [1967], and Thonstad [1968]. 

Of particular interest is a paper by McAfee [1970], 
which describes a different type of prediction model, the 
Se—called cohort model. “For this model McAfee tests the 


stationarity properties of the distribution of the remaining 


y ie 


fraction of an initial cohort size in subsequent periods 
after entry into the organization. This was done for 
three different cohorts which entered the system in three 
adjacent periods. This property will be assumed to hold 
in the prediction model of this paper. When the cohort 
sizes vary from period to period, McAfee showed that the 
Markov Models alluded to above do not accurately describe 
movement of personnel in the system. Since later in this 
Pawer Wee Considemeneweconoressuzeseas Control vari abess 
it is not meaningful to consider them constant in size Omer 
time, and hence we concentrate in this paper on the cohort 
model and analyze some of its characteristics. 

A basic assumption of this cohort model is that all 
members of a given cohort behave independently of each 
other, saga cach member's Jifetime in the system is a Tregge 
ization from a common stochastic process. These assumptions 
lead to the binomial distribution for predicting the con- 
tinuing fraction of a cohort with a given age in the system. 

The cohort prediction model views the number of per- 
sonnel in the system as a superposition of continuing 
Homi monsmeotl spest cohorts. Wesassume, that the behavichgem 
a member in one cohort is independent of, but from the same 
dis (ig mpeionwaice that cima, member ina different,.cohert. 
(By the word suena is meant different time of entry 
into the system.) To estimate the continuing portion of 
the present organization size for the next period, we sum 


Tie COMM pete lonemormrile past Cohorts, basing the 


expected number fromeach c@hort gon the smze al enn sand 
age in the organization. (viz., the sum of expected values 
of independent random variables.) 

As time elapses in the system for a given cohort, the 
number remaining in the organization from one period to 
Ghee next is dependent on the number remaining from the 
previous period. There exists a correlation, which is 
positive, between the remaining portion of a single cohort 
inweane period with the. remaining pertion,of.that same 
gehort in the previous period. This correlation,is cwumy-— 
lative when we examine the correlation between the sum of 
“continuing portions in one period with the sum of the 
eomeimuing portions,of the past period. I21 isSwthis corre- 
lation property of the model which will be of special 
siguifiicance,in improying the predictiony»characteristies 
ef Ge model, 

Another property of the model which will be proved and 
used to advantage is that with large cohort sizes, the 
MmbrraouLLon of the sum of continuime cohort porticns 
asymptotically approaches a normal distribution. This 
allows us to derive simple tractable formulae for predicting 
the number in the system in a given time period, given the 
number present in the previous time period, with no detailed 
knowledge of from which cohort the various members came. 
Without this property, the exact expression for the expected 


value of the organization size next period, given the 


reaiizgavion of the present size, is intractible. A best 
linear predictor for the expected value of the organization 
size next period is this conditional expectation when the 
organization size is a normal random variable. 

Mathematical expressions for the prediction of future 
OrgamnliZiawlon weuzicy Samewlapebie size of each»past and present 
ae aremedermived. A decomposition of the organization 
into grades is then made to predict future grade sizes 
withirm@@ive oreani zation. Finely, an application of the 
model is made to the university student enrollment problem, 
where the degm@red prediceronets thateer toval enrolimeny 
Tee data on past periods for new enrollments. This appli- 
cation is made with data from the University of California, 
Berkeley, during the period 1961 to 1969. For this model 
there are 16 lifetime. distributions which repeat yearly; 
one een for freshman, sophomore, junior and Semionm new 


students admitted into each of four academic quarters. 


10 


Lion MOREL 


Denote by X, Cu) the number of persons who enter an 
organization at time u in state i. Let K be the number of 
grades in the organization and M be the number of time 
periods (epochs) beyond which no member can remain in the 
organization. Let the probability of a single member 
remaining until at least s epochs have elapsed (since entry 
into the system), be p,(s), and let X,(u,s) be the number 
who entered in grade i at time u who are still in the organ- 
ization at time uts. Then X, (t-s,s) is the number in the 
System at time t of those that entered at time t-s in grade 
i, a binomially distributed random variable with parameters 
X, (t-s) and p,(s). The expected value of X,(t-s,s) ch 
p,(s)-X, (t-s) and the variance is X, (t-s)p,(s)q,(s), where 
q,(s) is l-p,(s). When the time elapsed since cohort entry, 
S$, reaches the value M, the expected value and the variance 
Chr X, (t-M,M) are both zero. It is assumed that the behavior 
of members of a cohort which entered the system at time t, 
is independent of those in a cohort which entered at time 
u, u#At. It is also assumed that for i# j, the behavior of 
the members of X,(t) is unaffected by that of members of 
X(t). 

Denote by Y,(t) the number of persons present at time t 
who started in grade i, and by Y(t) the entire population 


present in the organization at time t. The entire 


ih 


ae ees ee rOUmetemmit), ab time t, is the sum of all 
remaining portions of cohort sizes which started in each 
of the K grades, counting back M epochs from the present 
time. Thus we have that 

K K M 

ae a F Wet) = 2} d X, (t-s,s) 

=] i=1 s=0 

The expected value and variance of Y(t) are then respect- 


Liou yy. 


kK M 
te eee) ee) DS) 
i=l s=0 


and 


K M 
OC ar Gta ep, (s) a, (s) , 
i=l s=0 


where p, (0) = 1 and q, (0) —_ 0m 


Our ej 6ee. ve is to find expressions for the conditional 
expectations, E[Y(t+1)|Y¥(t)] and Am col) ee Ce for 
which we need the distributions of Y(t) and Y,(t). Although 
the first and second moments of Y,(t) and Y(t) have simple 
forms, explicit formulae for their distribution functions 
are very unwieldy. However, if ail initial cohort sizes 
are large for all time t, then the distributions for Y, (t) 
and Y(t) are asymptotically normal. This follows from 
Theorem One and the fact that the Y's are sums of indepen- 


dent random variables. 


re 


Theerem One: 

If {X,}, ieI, is a family of random variables, each 
independent and each distributed binomially with parameters 
Ds and N, (for O<p <1 and N, >On, where ate P,<@; then for 


al 
™ - “6 
W = ea WyRgdf G2 Ming) » the distribution of W 


asymptotically approaches that of a random variable which 


is distributed as normal (0,1) as N, rier, 


Proof: In order to prove the theorem, it is sufficient to 


show that the moment generating function, G(m), 


+5m 


approaches the limit e as N,roWiel. 


1 
= %B 
1. Let B (ie N.p.q,) 


Yel 
ad... ie ae 
a. The expansion of e is t 3, and for x small, 
j=0 7° 
x° 2 
is 1+x+-5 + o(x uF 


be Per x small, the term (1+x)” can be represented 


a Qnx 
a wer ( X,-N.p, )/B m(X.-N.p, )/B 
ae *G-aim) = Ble P= Ee = Tl Ele 
W ‘ 
L@F 
N -N.p.m/B 
= ¢ (p,e™? +q,) 1, i 
i&e I, 
N. 
q,m/B —p,m/B ais 
= It \p.e + q.e 
Tons : 
q.m/B 
3. From la, the expansion of p,e i q,m/B 


=p. 7B 
+ s(q,m/B) 1 b/s") and the expansion of q,e ae 


is q, C1 - p,m/B + ’s(p,m/B) ~ oGlGs yi. Thus, 


Ih 


‘ey 
= 
@ 
as 
QO 
D 
lt 


4 (p, tq.) 1° (p.4,-p,q. )m/B 


™ pa, (a, +p, ) (m/B)* + 0(1/B*) 


Ll + %p.¢.(m/B)* + 0(1/B°) 


4. From ib, and using the fact that the term sp. (m/B*) 
is small for large values of B, and neglecting 


o(17B"), we have 


a, m/B -p,m/B|N; p,q, N, (m/B)*/2 
p,e zt: qe re: spend henge 
2 Faye .c jeg (m/B)° 
Peo. Ne (mA) ; I iy, Saari ee 
Gi ee ee Sle aa = gifé =n 
W ; 
iB IE 


ieee md Gases N. Tanase coOntiinity for all i ing 


2 - _ 
since (1/B) C2 Pya4Ny) a ri 


It is now possible to examine how meart to estimate 
expected future values of Y and Y, when past realizations ene 
and Y, are known. When the conditional expectation of a random 
. variable cannot be found explicitly, Parzen (1960) suggests a 
best linear predictor which minimizes the mean square error of 
prediervton Using” a Mieaer tunetion of previous realizations 
When those random variables are normal, this function 
gives the exact conditional expectation of the random 
variable, given the value in the previous time periods. 
Theorem One justifies our use of the best linear predictor. 
Jil Theorem Two, we derive the general expression for this 


predictor, which we later specialize to our cohort model. 
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Theorem Two: 


A best linear predictor Ps™defined*v’o De Vial Tiree 


E*( = a + bite fom Y and X random vardebiles , 
which minimizes E[(Y-E*[Y])“], denoted Var" (Y). The expres- 


sions for E*(Y) and VAR*(Y) are 


B*(¥Y) = BCY) + [Cov(Y ,X)/Var(X))(X=n@) pene 


Var*(Y) = Var(Y) (1-p°(X,Y)), where the term o(X,Y) 


denotes the correlation coefficient of X,Y. 


eroor: 


1. Var*(Y) = E[(Y¥-(atbX))°]. 
2. = Var*(Y) = omiVe(o4bG)) =un2hii) * 2Catbaa) ; 
which when set to zero, yields a = E[Y]) - bE[X]. 
d # = = e 
a ap Var (Y) = -2E[(XY-X(atbX)] = - 2E[XY] + 2aE[X] + 2bE[X°], 
which, when set to Ov-and the substitution for a@ is made, 
E[XY] - E[XJE[Y] 


5 5 = Cow(X ,Y Wwrarik? 
E([X J] - E [X] 


yields b= 


we SUostiumting for a and b in the expression for “E#(Y% 
aad Wart(yY) . E*(Y)e= BY] = bEIxX] tepXx 
= E[Y] + (Cov[X,Y]/Var[X])(X-E[X]) and 
E[(Y-E*(Y])°] = b°E((X-E[X]]°)- 2bE[(X-£(X))(Y-E(Y))] 


Var(Y] + b“Var[X] - 2bCov[X,Y] 


Var*(Y) 


Var(Y] + [Cov(X,¥)/Var(X)1°Var[X] 


— 2 Coy eet |. / Varig cov lee Y | 


Var[Y] (1-Cov* [X,Y1]/(Var[X]Var[Y])] 


| 


Var [Y](1-9°[X,Y1]). O 


In a similar manner, E**(Y) = E(Y)+b,(X-E(X))+b,(Z-E(Z)) 
minimizes Var®*(Y) = E[(Y-E®*(Y))*°] when 

b, = D[Cov(X,Y)/Var(X) - Gov ¥,2)Cov(X,Z)/{Var(X)Var(Z) Fie 

b, = D[Cov(Z,Y)/Var(Z) - Cov(Y,X)Cov(X,Z)/{Var (X)Var(Z)}], 
and 


D = 1/(1-p°[X,2]). 


The expression for Var**(Y) then becomes 


2 
i 


2 


Var**(Y) = Var(Y) + b 


tom X)eteb., Var(2Z)+ 2b b,Cov(X,2) 


i 


~~ 2b.Cov(¥,X) - 2b 


1 Revi atoe 


2 


We now have an expression for the best linear predictor 
for Y(t) when we know the past realizations, Y(t-1) and 
Y(t-2), regardless of the distribution of Y(t). For large 
cohort sizes at entry into the system, we have that this 
predictor @lggibest Gl.e., At Gs E[Y(t)|Y(t-1),Y(t-2))), 
since the distribution of Y(t) will be very close to normal. 
Hence the functions E* and E** are actually conditional 
CxpeeCuaws ons sel Vem Gases past per1od error and the last tye 
period errors respectively. 

The only terms used in expressing E* and E** which have 
not been derived are the Cov[Y(t),Y(t-1)] and Cov[Y(t) ,Y(t-2)]. 
Since independence exists between X,(t-y,y) and X5(t-y sy) 
for i#j, and independence exists between X,(t-s,s) and 


X,(t-u,u) for u#s, the covariance between all such X terms 
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is zero except for that between X, (t-s,u) and X, (t-s,w). 
Here, for w>u, we may interpret X, (t-s,u) to be the remnants 
of the initial sizgesaueene im... X,(t-s); and X, (t-s ,w) to be 
the remnants, some w-u epochs later, of X,(t-s,u). Given 
that a member of X, (t-s) remains until a time u epochs 
later, the probability of his remaining until w epochs after 
entry into the system is p,(w)/p, Cu). (Assuming a person 
who leaves the system does not return, O<[p, (w)/p, (u)]<1 
fer"wou.) The conditional expectation of X, (t-s,w) given 
the realization of X, (t-s ,u) is then [p, (w)/p, (u)]X, (t-s,u). 
Hence the covariance of X, (t-s,u) and X, (t-s ,w) is derived 


by the following argument: 


E[X,(t-s,u),X,(t-s,w)] = [pj(w)/p, (u) JELX, (t-s,u)°I; 
Cov[X, (t-s,u) ,X, (t-s,w)] = [p, (w)/p, (uw) JELX, (t-s ,u) ©] 
- E[X, (t-s,u) ]E([X, (t-s,w)], 


where 


E[X,(t-s,u)] p, (u)X, (t-s) and 


E[X,(t-s,w)] = p,(w)X, (t-s); 
and 


E[X,(t-s,u)]E[X, (t-s,w)]= p, (w)p, (u) [X, (t-s) 1° 


I 


ip, (w)/p,(w)] fp, (u) 1° 1X, (t-s) 1° 


ip, (w)/p, (u) J{ELX, (t-s)1}°. 


al 


Hence, 


Cov{X, (t-s,u),X,(t-s,w)l = [p, (w)/p, (u) JE[X, (t-s,u)*) 


-[p, (w)/p, (u) JE* [X, (t-s,u)] 


[p, (w)/p, (u) ]Var[X, (t-s,u)] 


p, (w)q, (u)X, (t-s). 


To express the covariance between X,(t-s,s) and the remain- 
ing number of the cohort with age s-l (one period ago), 


X,(t-s,s-l), we substitute w=s and u=s-—-1 to obtain 
Cov[X, (t-s,s),X, (t-s,s-1)] = p,(s)q, (s-1)X,(t-s), for Sar 
Similarly, by substituting w=s and u=s-2, 


Cov [X, (t-s,s),X, (t-s,s-2)] = p, (s)q, (s-2)X, (t-s), for See 


Neo bain Cov[Y,(t),¥, (t-1)], the covariance existing 
between all the remnants at time t from cohorts which entered 
grade i and the remnants from the same cohorts at time t-l, 


WescUlmom cuaurom i GOminCO Obtain, 


itt = 


Cov[Y, (t),Y,(t-1)] Cov[X, (t-s,s),X,(t-s,s-1)] 


s=l 


s 


Zp, (s)q, (s-1)X, (t-s). 
s=l] 
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Similarly, (mutatis mumjandi ) 


ae 


Cov[Y,(t),¥,(t-2)] Cov[X, (t-s,s) ,X,(t-s,s-2)] 


S=2 


wus 


p,(s)q, (s-2)X, (t-s). 


S=2 


The independenee between cohorts starting in different 
grades has already been established; thus, to obtain the 
covariance between all remnants at time t with ala» remnants 
at time t-l and with all remnants at time t-2, we sum on i 
from 1 to K the covariances of the Y, (t) terms to obtain, 


Pe Speietd. vel, 


K 
Covet yyy(t-1)] = & Coviy, (ty,y¥, (t-0 
i=l 
K M 
= 5 zp, (s)q,(s-l)X, (t-s) 
i=l s=l 
and 
K 
Coviv(t) ,Y(v=2) = a Coviy, (1), (t=2)) 
1=1 
K M 
= X p.(s)q.(s-2)X. (t-s). 
{=1 s=2 + e a 


To summarize, the expressions for estimating the 


remaining persons in the system who started in grade 1 are 


Be(Y, (t)) = BLY, (t)] + by ,tY, (t-1) eMC E A) D 


nm 


where 


ig. CoviY, (t),¥, (t-1)]/VarlY, (t-1)]; 
and 
7 a: 
tee) = B[Y,(t)} + by 4{¥, (t-1) - ELY, (t-1)]) 
2 
+ by 4 {¥,(t-2) - ELY, (t-2) 1}, 
by 4 = G(A,-A,A3) and by 


pee (A,—AA,) 


A= 1/{1=p* [Y, (t-1) ,¥, (t-2)1}, 


= 
ll 


1 Ota erica) vary. (t-1)1 = Ds te 
A, = Gor eae be liyvarly, (t-2)] = bs te] 
and 
Ag = Cov[Y,(t),¥, (t-2)]/VarlY, (t-2)]. 


The CxXprespmemsefoneestimmering the number of total 


remaining persons are then 


Cry i. me tome) = E[Y(t-1))} 
and 
E*#(¥(t)) = E(¥(t)] + by{¥(t-1) - E[¥(t-1)]} 


+ pe{¥(t-2) - E[¥(t-2)]}; 
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where 


bie = Cov({¥(t) .¥(t-%) )/Var[(y(t-1))) 
K K 
= EL CowLY.4t),.¥.(t-1))/ he Vart@s(c-1p tt, 
a ; : i=l 
by, = D{b, ~ bd, _,Cov[¥(t) ,¥(t-2)]/Var(¥(t-2)1}, 
be = D{cov(¥(t) ,¥(t-2)]/Varl¥(t-2)1 = b,b,_4}; 


and 


D = 1/{1 - p°[Y(t-1) ,Y¥(t-2)]}. 


This model for predicting the total number in the system 
at time t is applied to the problem of predicting total 


student enrollment in Section V. 


aA 


ee eGo PECIRIED GOALS 


Consider an organization in which desired numbers are 
Specified for personnel in each grade i at times ttl, 

Cie, eee eo lceis the objective of this 
Secvicn VO eextenGe tiommecewmmor section II for predicting 
the future grade size within the organization. 

Define P44 (t,u) to be that fraction of personnel in 
grade j at time u given they entered the system in grade i 
at time t, where t<u. If the grades are numbered in 
hierarchal order, i<j means a promotion, i>j means a 
demotion and i= j means that p,,(t,u) ig the fract igen 
which remain in grade i in the period t to u. If the rates 
of movement between grades are stable for the time period 
being considered, MieasammorLon Of stationarity or 
independence of time t can be made about the transferred 
fractions, and py, (tu) can be expressed as p,ju-t); that 
is, lft-—mwemaculomeureansterred from grade i to j is a fumes 
tion only of the elapsed time u-t. Let X,, (tu) denote 
the number in grade j at time u of that cohort which 
entered the organization at time t in grade i. We then 
have that X,,(t,u) is a binomially distributed random 
variable with parameters X,(t) and Py ,(u-t). 

By representing the number of persons in grade j at 
time t as yJ (t) in terms of the cohort portions remaining 
from the initial size at entry into grade i in all previous 


epochs, we sum on i and sum on s to obtain 


ae 


: M «kK 
y(t) = © Ff X,,(t-s,t). 
s=0 i=l 74 


it should be noted that Xi, (t-s,t) is independent of 
X,,(t-r,t) for s#r, since the cohort entering grade i 
at time t-s acts independently of the cohort entering 
grade i at time t-r. Note also that X43 (t-s st) is inde- 
pendent of X,,(t-s,t) for i#k, since these are the rem- 
nants presently in grade j at time t of those who 
entered the system at time t-s in different grades. For 


a= 0, X, (t tow O.when 1474 and X,.(t,t) is X.(t); 


J 
that is, 2a eohort which just ,.entered the system at time & 
in grade ji will have no opportunity to diminish in sige 
or to be transferred to another grade in the same time 
period. Of note is the fact that independence does not 
exist between vy) (t) and v(t) for j #£, since in each of 
these random variables, there may exist people from the 
Some initial cohort. At this point, the distdnetion 
between Y, (t) and y Ge should be clear. In expressing 
Y,(t); we are counting the remnants of X. (t-s) in alt 
grades. In expressing yi(t), we are counting the number 
presently in grade i as remnants from all previous time 
Periods of cohort entries into all grades. 

We are now able to express the means and variances of 
yd (t) as the sum of independent means and variances 


respectively of X,,(t-s,t); 


K MM K M 
E(y’(t)) =E[r© 2 Xqu(t-sst)} = E£ © p,,(s)X, (ts) 
1=1 s=0 7d {=1 s=0 td ‘ 
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and 
j K M 
VaroiGaGi) | = ee Beis Mid 14 3° Sy t—84 for j=1,2..... a 


where 


\ = — a 
p,, 69) ie and q4;°s) 1 P4368). 


Denote by G. (ttl), G.(tt2),... the desired goals for 
the size of grade i at times ttl, tte, ... respectvivele 
Denote by Z,(tt1), Z,(tt2),... the numbers in grade i at. 
times SLT wer te.... whiten were in the system atte 
(Ce Cttietmemebt? Cle OL Che K grades of the orfganizZagrens 
The number X,(t+1) is now a controlled variable; that mex 
the number to recruit into grade i at time ttl in order to 
Omega) ott) = G. (ttl) = 2, (t+1). 

We now assume that no.demotions take place; p43, ¢s) is 
zero for j<i. At the end of each epoch, a member is 
Promoted; remains in present grade or leaves the organ 
Zation. With these restrictions placed on the system, we 


have that the expected values for Z,(t+1) and Z,(t+2) are 


a M 
Eee (tee) ] sed te X e@eeiss) ps. (s% 
= j=l s=0 J sis 
and 
" M 
Eien tcee) | = + Mex.(t-stl)p..(s), for 1 = 1,2) jm 
+ j=l s=0 J ye 
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The variance terms are 


IL M 
VarlZ, (t+1)] = re See 3 
and 
ik M 
Marah ZaGth 2) )= = Re TRA eae) TER ee 


With the same definition as in Section II for p, (s) (that 
tome tae fraction of X, (t-s) remaining in the system at 
time t), we have a restriction that Pas) = p, (s) 

fem all values of s (i.e., s=0,1,...M). 

Seppose. our problem is to avoid overmanning or under— 
Manning grade i (i=1,2,...K) in the periods ttl, t+2, when 
the recruiting for these periods must be planned ahead. 
Assume that there are penalties defined when the number 
Of personnel in each grade is above or below the goals; 
me. , ence) when the number is over the desired values and 
C, (t) when below. The objective function to be minimized 
aig 


ral K 
B,| 7 y {C,*(t+x)Max[0,G, (t+x)-X. (t+x)-Z. (ttx)] 
X=] 1-1 - s : : 


3G (tx) (056, (tx) “Ky (t4x)=24 (t42)1)) 
where E,(:) denotes the expected value of (:) over the joint 
density of Z = (2, (tt1),2,(tt1),...2,(t+1) 2, (tt2),...2,(tt2)), 


a vector of random variables with a multivariate normal 


distribution. The optimal values of X,(tt1) and X, (tt2) 


aD 


would minimize this objective function. When Che, 


the expression reduces to x 204 (tx) {G, (t+x) - X, (ttx) 
-E[Z, (tt+x)]} » Where the expected values of Z,(t+1) and 
Z.(tt2) are given above. 

We now assume that the realizations, Y,(t) ew? . «sca 
are known and proceed to determine the best predictors for 
Z,(t+1) and Z,(t+2). Assuming the normality property for. 
Z; and Yas this best predictor will be the expectation of 
Z. condi vl1cnedeon thes values of ie jee, . wos ror Une 
present time; defining E'(Z,(tt1)] as the best predictor 


Oi Z, (ttl) and E''(Z,(tt2)] as the best predictor of Z, (tt2) 


when the values of Y(t) are known, j=l1,2,...K, we have 


Eiih2, (CF 1) de = E[Z,(t+1) | Y s(t), ib eee 
and 


Bet aect te yy = E[Z,(t+2) | Y,(t), jm oe eel 


where the realizations of ¥,(t) for j>i have no effect on 
E' and E!' and are therefore not considered in the expres- 
sions. (This follows from the fact that P54 (s) is zero 
(Om icin 

With the independence of Y, (t) and Y,(t) for i#j, we 
have that Cov[Y, (t),¥,(t)] = ) for iF 4%. Since X, (t+1) is 
a control variable, we also have that Var [Y9 (t+x)]= Var (2, (tt) 
and Cov{Yd (t+x) ,Y,(t)] = Cov[Z,(t+x),¥,(t)] for x=1,2 and 
i=ieewe. ke” Extending Theorem Two ofSectiome ll ter lh fame 


E''t we thus have that 
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it 


' a a 
E'(Z,(t+1)) = B(Z,(t+1)1 + 2 adtqy,(t) - Bry, (tyI}, 


jel 
and 


a 
E''(2,(tt2)) = E(Z,(t+2)] “+ ve 


ji . 
J2y8e f(t) - BEECH), 


where 
me - CovlY"(t+1) ,¥,(t)]/Varl¥,(t)] 


and 


gi* = Cov[¥"(t+2) ,¥,(t)]/Varl¥,(t)] 


perei = 1,25...K. ~The terms E(Z,(ttx)], x=1,2, and 
VarlY;(t)] have been derived previously, leaving the terms 


Cov(Yi(t+x),Y,(t)), x=ly2, tombe derived: 


Given that a member who entered the organization at 
Came t-s in grade j was in tle organization at tame f., 
(i.e. a member of X,(t-s,s)), the probability that this 
member will be in grade i at time t+x, is Py, (stx)/p,(s). 
The conditional expectation E[X,, (t-s,stx)|X,(t-s,s)] 
(that is the expectation of the number in grade i at time 
tex OL those who started im grade j at time t-s, onmditioned 
ee chine number remaining in the organization at tame t aiho 
started at time t-s in grade j), is Pj, (stx)X,(t-s,s)/p,(s). 
Using a similar argument as that in deriving Cov[Y(t),Y(t-])] 


im SECTION Lifewe Weave that 


om 


p,,(stx) 2 
E(X,,(t-s,s+x) ,X,(t-s,5)] = om E[(X,(t-s,8))°] 


hence 
p., (S+x) 
Bes) 


Cov[X,,(t-s,s+x),X,(t-s,s)] var[(X,(t-s,s)] 


pj, (stx)a,(s) X,(t-s). 


Since independence exists between X,(t-s,s) and X,(t-r,r) 


and between X..(t-s,stx) and X..(t-r,rt+tx) for s#r, we have 


Jt alae 
: M-x 
Cov[Y (t+x),Y.(t)] = £ Cov[X.. (t-s,stx) ,X,(t-s)] 
| J s=0 J+ J 
M-x 
= ey oy Serene 


ROM elas 2 tuted , 2 ye ea 


The use of E' and E'' in place of E(Z,(tt1)) and 
E(Z,(t+2)) respectively (using the present values of 
Y,(t)); is relevant in the objective function when 
C, (t+x) = C4" (t+x) = C, (t+x), x=1,2. The variances 
(using the fact that Cov(¥,(t),¥,(t)) is zero for i#j), 


aiee 


Var'[Z,(tt1)] = VarlZ, (t+1)] + 


et 2 
t (de ) var[Y,(t)] 


(Ge res 
HI 


jal 


- edt 


cov[Y*(t#1) ,¥,(t)1] 
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and 


tL a 
Var''[Z, (t+2)]) =NarlZ.( tes 2 {(eI1)*varly.(¢)] 
ale 2 al t ‘J 


z 2g) *coviy*(t+2) ,¥,(t)1}. 

An. appdicationsei such a modek might sbensthat.wef eine 
versity system withethe, grades defined fer curnmeuv lum sand 
level (upper and lower, for instance) in which the goals 
G, (ttl) and G, (tt2) were specified to fully utilize the 
fact liuties, without, ingfddeting aglack of classrooms 
over-enrolling. The costs, C, (ttl) and C, (tte) might be 
mased.on tke losses, encumred financially, by, overstafiing 
for a belew-desired-level of class size and the_losses 
emeurred by the added administrativesburdens of rejecting 


Sacel ledestudents when overages occur. 


ae 


eae Seo ry SO ERRORS IN PROBABILITY ESTIMATION 


The fraction of remaining individuals who start in a 
grade and remain for s epochs, s=l,2,...M is used as an 
estimate of the probability of this event. In the paper by 
McAfee (1970), a statistical test is made on a sample of 
size three, to test the hypothesis that p, (s) for each 
cohort is from the same population. This is to say that 
for moderate sample sizes in estimating p,(s), we have a 
sample mean to use, which for large sample sizes, approaches 
the true value of p,(s). At best, we use the estimate and 
for this reason examine the sensitivity of the model 
described in Section II to error which may exist between 
our estimate and the true value of p,(s);5 viz., the sensi- 
tivity of the expected values, E[Y(t)], E*[Y(t)] and 
E**#(Y(t)] to errors in p,(s). We shall consider two cases: 
one, in which an error Ap, (s) exists between our estimate 
and the true value of p, (s) for some s and i; the second, 
in which an error exists for all s and i. 

Waking the partials of ELY(t)], E*[Y(t)] and E**(y(t@ 
with respect to p,(s) yields the following expressions 
(where mn denotes the partial differential of (:) with 


respect to p,(s)): 


dE[Y(t)]/dp,(s) = d/dp,(s) [X,(t-s)p,(s)] = X,(t-s); 
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dE*[Y(t)]/dp, (s) d/dp,(s) E[Y(t)] 
+b, d/dp, (s) 1Y¥(t=}) = Bi¥te=2y} 


+ {Y¥(t-1) - E[Y¥(t-1)] }d/dp, (s)(b,) 


% 
X, (t-s) = b, X,(t-s-1) + Ry a 
and 
dE**[Y(t)]/dp, (s) = d/dp, (s)E[Y(t)] 
r b, d/dp, (s) {¥(t-1) - BIv@eDo) 


“ be d/ap, (s) {¥(t-2) ~ E[Y(t-2)]]} 


+ {Y¥(t-1) - E[Y(t-1)]}d/dp, (s) (by) 
+{¥(t-2) - E[Y(t-2)]}d/dp, (s) (©) 
= X.(t-s) - b?X, (t-s-1) -b?X.(t-s-2) +R#*; 
i mel £4 ae 
where RY and Re * represent sums of remainder terms negligible 
in comparison to other terms in the expressions respectively 
for dE*/dp, (s) and dE**/dp, (s). (See the appendix for the 
exact expressions represented by R.) 
Neglecting R for small errors in the estimate for p,(s), 
the changes in predicted values are then 
dE (Y(t) ] = X,(t-s) Ap,(s), 
# = 2 Es -—e 
AE* [Y(t )} Wray - be (t-s=0)} Ape(s), 
and 


AE**(Y(t)) = {X,(t-s) - b-X, (t-s-1) az beX, (t-s-2) }Ap, (a). 


oh! 


cae woes eevecdmigernesexpresswon for E*(Y(t)), that as 
by approaches one, the error in the predicted value is 
Ap, (s) AX, , where AX, =X, (t-s)-X, (t-s-1). Since the covari- 


ance of Y(t) and Y(t-1) is always positive, b, takes on a 


value between zero and one, and b, acts as a dampening 


v 


eeeiiicient 1foreeerersmim thes predicted value of Y(t) using 
BS, 
Similarly for the expression of AE**(Y(t)), as 


(by +bo) +1, the error in the predicted value is 


d igeqe! > 2 il 
Ap, (s) (bo, AX; r by ‘i 


and Ax‘ = X, (t-s) - Pte —2 (Viz., the differences 


AX‘), where the terms AX; = X,(t-s)-X,(t-s-1) 
between the cohort entering grade i at time t-s and those 
which enter the periods before.) The dampening effect of 
the error in E**(Y(t)] is evident with the by coefficients 
(i=1,2). Thus, a smaller error in the predicted value of 
Y(t) results when E* and E** are used rather than E. 

Let us assume now that p,(s) = CS for. all dg 
we have a geometric distribution of remaining members of 


each grade. If X,(t) were a constant for all t (that is, 


X,(t) = X,), then 


K [ K 
Ef(y(t)] = 2 Efy.(t)] = 2 |] 2 X.p.°] = &£ X./(1-p.); 
te i=l]s=0 * > i=l 7 = 
K 
(Gey) = ae Mood Nel) 5), a9 Pee) 7 ee 


{Oe 


Be 


and 


ior 


and 


The 


and 


The 


N 


E**(Y(t)) = * xr) + pb, [¥(t-1) - X,/(1-p,)] 
1= 


2 
+ by [¥(t-2) - X,/(1-p, 08, 


1 | K 3 K 1 
bap [b/(1l-b)] {1 - 2% p,~X,/ 2 rl ew 
A = . i. al 
i=l i=l 

K K 
be = [1/(1-b)} by » Py 3x, ,/* = p,X, | - (pf = b@. 

i=l i=] 
resultant terms for the changes in prediction are then 


K 2 
nE(¥(t)) = 2 ap,X,/(1-p,°), 
i=l 
K 2 
MEPIY Rt) ) = of Ap, X,(1-b)/1-p, ) = (1=-b MBL Y(t)), 
i=] 
- 1.2 2 Le2 
AE*®* (Y(t) ) I Ap, X, (1-b =a Jeg ) = (1-b —-b )AE(Y(t )). 
i= 


dampening factors for AE*(Y(t)) and AE**(Y(t)) are 


immediately evident. 


Using E* or E** in place of E, we can reduce the predic- 


tion error caused by errors in p,(s). 
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vo APPLICATION TO UNIVERSITY ENROLLMENT 


The problem of predicting total student attendance at 
the University of California, Berkeley, is approached using 
the model for the best mredictoors, B*(Y(t)) and E**(Y¥(a jie 
Data was obtained from Berkeley for new student enrollments 
and is tabulated in Table I. 

[Cia SeueclUeBe cme ltenemare reguired for only the 
fall quarter of each year. To estimate the p,9(s) (the 
probability of a student remaining to the g bh Subsequent 
fall period following entry into grade i, quarter j), data 
was collected on the numbers of students from the cohorts 
entering in the Fall of 1966 and the Winter, Spring and 
Summer of 1967 which were still in attendance each succeed-— 
ing fall term. The most recent data available was from 
1969, which included at most 3 years for any cohort. To 
estimate p,°(s) for the years in attendance 4-6, we 
assumed students! attendance behavior over time is essen- 
tially stationary and used past cohort data analysis found 
in Suslow et i (1968). Our estimates of p,9(s) are given 
in Table II. it should be pointed out that during the 
yearsm toe] through 1966, the University followed a semester 
system. Starting in 1967, the University switched to a 
duaryer=sys cel. 

The parameters calculated with the data in Tables I and 


lil are shown in Table III. ‘Our estimates of total student 
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TABLES 1 i 


VALUES USED IN PREDICTION 


CASE A 
(Summer enrollees treated as Fall enrollees) 
Year 
__ 1960) oi ie eo 
Peau) 16919.5% 10539.5 Meecor. 
Var(Y(t)) ? 67.66 67.86 Tye 02 
Gey (Y(t) ,Y (t=1)) Bee 6 lies jl) 2186.0 
Gom(Y (tb), Y(t-2)) 984.5 
o0(Y¥(t),Y(t-1) 49245 =, 45358 
b, ~49400 AL Tah 
be 48641 
bé -~.02561 
ute eH eCy(t) 72, 5 51 Bee 398 
var®(¥(t))? 59.06 63.29 
Var®*(Y(t))? 63.49 
CASE B 
(Summer enrollees treated separately) 
Year 
1966 1967 1968 

BeY(t)) w69m9.5 18186.9° 17646. 4 
Var(y(t))# 67.66 68.73 73.04 
er (y(t),Y(t-1)). 2077.6 2g 6 
@eviy(t) ,Y(t=-2)) 971.9 
o(¥(t),¥(t-1)) .46828 .46704 
bt | 47577 =. 49637 
be 51040 
bé 03070 
ct )—-B( Y(t )) =l72.5 Pigori2e iWiers) 
var*(¥(t))? 60.75 64.59 
Var®*(Y(t))* 64.74 
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1969 


LOW33 . 7 
70.83 
2349.6 
970.6 
~46707 
46586 
~46055 
-.01305 
-17.742 
62,63 
62.63 


1969 
17329.6 
73.60 
Zao). | 
1077.6 
44559 
“ee 
44672 
00637 
786.42 
65.89 
65am 


enrollment in the falls of 1967 through 1969 are given in 
Table IV with actual enrollment figures for comparison. 

Due to a quota limit for fall enrollees and the start of 
year round operations in the Summer of 1968, it was felt 
that summer enrollees in 1968 and 1969 might not behave as 
summer enrollees in 1967, but in fact be early fall appli- 
cants who enrolled in the summer rather than risk unsuc- 
cessful enrollment in fall. This fact plus the relatively 
onal sample size from 1967 for estimating the probabilities 
for summer enrollees to remain in the system, leads to two 
cases for estimation: Case A, in which summer enrollees 

are treated as fall enrollees; and Case B, in which summer 
enrollees are considered separately from the fall enrollees. 


The two cases are tabulated in Tables III and IV. 


38 


TABLE IV 


PREDICTED TOTAL FALL ENROLLEE? 
(With plus or minus two standard deviations) 


CASE A 
(Summer Enrollees treated as Fall Enrollees) 
Year 
1967 1968 1969 
Estimate using E 183394135 183654142 161333285 
Estimate using E* 182564118 . 183644126 179992125 
Estimate using E** 183684126 2179624125 


CASE B 
(Summer Enrollees treated separately) 
Year 
1967 1968 1969 
Estimate using E 181864137 178484146 173294147 
Estimate using E* 16104FLDS «1VE@P+129 177032131 
Estimate using E** 179194129 173944131 


Retual Total Enrollment Ss 27 17991 18116 
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APPENDIX A 
REMAINDER TERMS TO PREDICTION ERROR 


In Section IV, expressions for dE*/dp, (s) and dE**/dp,(s) 
are given in which small remainder terms are represented by 


Re and Re respectively. The term RS represents 


U2 Ce ae) — Seer et) } G7 cot s)) where 


{X.(t-s-lq.(s-1)-X. (t-s)p. (st1)-b,_X. (t-s-1) [1- 2p. (s)]} 
db,/ dp, (s) = fe See: ee Se en a aes 
Var [Y(t-1)] 


The term Re* represents the sum, 
{¥(t-1)-EL¥(t-1)] }aby/dp, (s) + {(Y(t-2)-E[Y(t-2)] }ab</ap, (s). 


Cov[Y(t) ,Y(t—25m 
db ./dp,(s) = reso esr the Ver ett 1)}-b,_ 1 Varl¥(te2y) 6 y-Po) 


A 
* vartyCestyy ‘Ar~Aa~AgAytastag) » 


fe 
db “/dp (s) = ae Y(t-2)]- bb, _7Var [Y¥(t-1)]}(D,- Do) 


+ vaRTECecayy (Cy-S2-0g-Cy) 


where 


IL 
As = VarlY(t-1)) [X, (t-s-1)q,(s-1) - X,(t-s)p, (st1)] 


I> 
IH 


be ee) [T2p ts )) 
b [X, (t-s-2)q, (s-2) - X,(t-s)p, (st2)] 


ae b= |. 
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Camly (toa ( Ga2ee 


Ay — Varty (toy png | oan ee = X, (t-s-1)p, (st1)} 


. Cov l¥ (tj) .yY (t=25 1s VaerilGiod 
5"  Varltit—2 1 Vaniett gm Nees Maugeme =? 11 —2p4 Soa) 


Cowl¥ (t).¥ G2 


 — —_Van@@r=enme apie Saale tay 


& 


1 X, (t-s-2) {2q, (s-1) + be [2ae(s-1d 


O 
It 


Oo 
it 


2 X,(t-s-1) fap, (s41) ii ee ed ete | 


L 7 VaRIVTESEIT (%(t-5-2)a4 (8-2) ~ 4 (t-5)p, (942) 


Cov[Y(t) ,Y(t-2)] 


> = aie eae X, (t-s-2) [1-2p, (s)] 


© 
I 


= b, (kh, (tosse) iggy (s-h) = b + 2p, (s)} - X, (t-s-1)p,(s+1)] 


t-1 
Cy, = D, _7 Pe ees i) tee) bt ape Gee) J — X, (o-sekon Corl )a 


and 


P= lng( hump Ninety, Yale Qadole) « 
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